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Amendments to the Claims ; 

This listing of claims will replace all prior versions, and listings, of claims in the 
application. 

Listing of Claims ; 

1-lS. (Cancelled) 

19. (Currently Amended) A method of processing a continuoiis audio stream 
containing human speech from a plurality of speakers related to at least one particular 
transaction, comprising the steps of: 

identifying a known speaker jfrom among the plurality of speakers; 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

performing a speaker recognition if a speaker change is detected; and 

transcribing at least part of the continuous audio stream if the known speaker is 
recognized; 

wherein each speaker is processed using a different dictionary of different topics . 

20. (Previously Presented) A method according to claim 19, comprising a 
further step of protocoliing time information for detected speaker changes. 
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21. (Previously Presented) A method according to claim 19. wherein the step of 
detecting a speaker change and/or the step of performing a speaker recognition is/are 
preceded by a further step of detecting non-speech boundaries between continuous speech 
segments. 

22. (Previously Presented) A mefliod according to claim 19, wherein the step of 
detecting a speaker change is accomplished by use of at least one characteristic audio 
feature, in particular features derived from the spectrum of the audio signal. 

23. (Previously Presented) A method according to claim 19, wherein the step of 
performing a speaker recognition involves the particular steps of calculatuig a speaker 
signature &om the audio stream and comparing the calculated speaker signature with at 
least one known speaker signature. 

24. (Previously Presented) A method according to claim 19 for use in a speech 
recognition or voice control system comprising at least two speaker-specific speaker 
models and/or dictionaries, wherein interchanging between the at least two 
speaker-specific dictionaries is dependent on the detected speaker change and the 
corresponding recognized speaker. 

25. (Currently Amended) A method of processing a continuous audio stream 
containing human speech of a plurality of speakers related to at least one particular 
transaction, comprising the steps of: 
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identifying a known speaker from among the plurality of speakers^ 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

performing a speaker recognition if a speaker change is detected; and 

indexing the audio stream with respect to the detected speaker change if the 
known speaker is recogniz^ed 

wherein each speaker is processed using a different dictionary of different topics . 

26. (Previously Presented) A method according to claim 25, comprising a 
further step of protocoUing time information for detected speaker changes. 

27. (Previously Presented) A method according to claim 25, wherein the step of 
detecting a speaker change and/or the step of performing a speaker recognition is/are 
preceded by a fiirther step of detecting non-speech boundaries between continuous speech 
segments. 

28. (Previously Presented) A method according to claim 25, wherein the step of 
detecting a speaker change is accomplished by use of at least one characteristic audio 
feature, in particular features derived from the spectrum of the audio signal. 
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29. (Previously Presented) A method according to claim 25, wherein the step of 
perfortning a speaker recognition involves the particular steps of calculating a speaker 
signature &om the audio stream and comparing the calculated speaker signature with at 
least one known speaker signature. 

30. (Previously Presented) A method according to claim 25 for use in a speech 
recognition or voice control system comprising at least two speaker-specific speaker 
models and/or dictionaries, wherein interchanging between the at least two 
speaker-specific dictionaries is dependent on the detected speaker change and the 
corresponding recognized speaker. 

31. (Currently Amended) An apparatus for processing a continuous audio 
stream containing himian speech from a plurality of speakersjelated to at least one 
particular transaction, comprising: 

a predeterminer which predetermines at least one known speaker fiom among the 
plurality of speakers; 

a detector which detects speaker changes in the atidio stream; 

a recognizer which recognizes the predetermined speaker in the axidio stream; and 

an initiator which initiates transcription of at least part of the audio stream in case 
of a detected speaker change and a recognized predetermined known speaker; 
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wherem each speaker is processed using a different dictionary of different topics . 

32. (Previously Presented) An apparatus according to claim 31, further 
comprising a detector which detects non-speech boundaries between continuous speech 
segments. 

33. (Previously Presented) An apparatus according to claim 31, fiirther 
comprising a scanner which automatically scans a continuous audio record, in particular a 
continuous audio stream recorded on a data or a signal carrier, and for detecting speaker 
changes in the contmuous audio record. 

34. (Previously Presented) An f^)paratus according to claim 31, further 
comprising a monitor v^ch continuously monitors a real-time continuous audio stream 
and performing the steps of 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

performing a speaker recognition if a speaker change is detected; and 

transcribing at least part of the continuous audio stream if a predetermined 
speaker is recognized. 
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35. previously Presented) An apparatus according to claim 31, further 
comprising a monitor which continuously monitors a real-time continuous audio stream 
and performing the steps of 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

performing a speaker recognition if a speaker change is detected; and 

indexing the axidio stream with respect to the detected speaker change if a 
predetermined speaker is recognized. 

36. (Previously Presented) An apparatus according to claim 31, further 
comprising a logging device which protocols time information for the at least one 
detected speaker change. 

37. (Previously Presented) An apparatus according to claim 31, comprising a 
marking device which marks at least the beginning of a detected speech segment related 
to a predetemuned speaker. 

38. (Previously Presented) An apparatus according to claim 31, comprising data 
base which stores speech signatures for at least two speakers. 
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39. (Currently Amended) An apparatus for processing a continuous audio 
stream containing human speech from a plurality of speakers related to at least one 
particular transaction, comprising: 

a predeterminer which predetermines at least one known speaker from among the 
plurality of speakers; 

a detector which detects speaker changes in the audio stream; 

a recognizer which recognizes the predetermined speaker in the audio stream; and 

an indexer for indexing the audio stream dependent on a detected speaker change 
and a recognized predetermmed speaker; 

wherein each speaker is processed using a different dictionary of different topics . 

40. (Previously Presented) An apparatus according to claim 39, further 
comprising a detector which detects non-speech boundaries between continuous speech 
segments. 

41. (Previously Presented) An apparatus according to claim 39, further 
comprising a scanner which automatically scans a continuous audio record, in particular a 
continuoiis audio stream recorded on a data or a signal carrier, and for detecting speaker 
changes in the continuous audio record. 

-8- 

PAGE 18/26 ' RCVD AT 10/3112005 9:24:38 PM [Eastern Standard Time] * SVR:USPTO-EF)(RF-6/26* DNIS:2738300* CSID:412 741 9292 * DURATION (min-ss]:05-08 



'• 10-31-' 05 22:24 FROM- 412-741-9292 T-053 P019/026 F-375 

Atty. Docket No. DE920000055US1 

(590.080) 

42. (Previously Presented) An apparatus according to claim 39, further 
comprising a monitor which continuously monitors a real-time continuous audio stream 
and performing the steps of 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

perfomiing a speaker recognition if a speaker change is detected; and 

transcribing at least part of the continuous audio stream if a predetermined 
speaker is recognized. 

43. (Previously Presented) An apparams according to claim 39, further 
comprising a monitor which continuously monitors a real-time continuous axidio stream 
and performing the stqps of 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

performing a speaker recognition if a speaker change is detected; and 

indexing the audio stream with respect to the detected speaker change if a 
predetermined speaker is lecognized. 
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44, (Previously Presented) An apparatus according to claim 39, further 
comprising a logging device which protocols time information for the at least one 
detected speaker change, 

45. (Previously Presented) An apparatus according to claim 39, comprising a 
marking device which marks at least the beginning of a detected speech segment related 
to a predetermined speaker. 

46, (Previously Presented) An apparatus according to claim 39, comprising data 
base which stores speech signatures for at least two speakers. 

47. (Previously Presented) A speech recognition or voice control system 
processing an iiusoming audio stream containing human speech fix)m a plurality of 
speakers and having at least two speaker models and/or speaker-specific dictionaries, 
comprising: 

a detector which detects a speaker change in the incoming audio stream; 

a gatherer which gathers speaker-specific information and for comparing the 
gathered speaker-specific information with corresponding speaker-specific information of 
at least one predetermined known speaker from among the plurality of speakers thus 
recognizing the at least one predetermined speaker; and 
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an interchanger which interchanges between the at least two speaker-specific 
dictionaries dependent on the detected speaker change and the corresponding recognized 
speaker. 

48. (Currently Amended) A program storage device readable by machine, 
tangibly embodying a program of instructions executable by the machine to perform 
method steps for processing a continuous audio stream containing human speech from a 
plurali^ of speakers related to at least one particular transaction, said method comprising 
the steps of: 

identifying a known speaker from among the plurality of speakers; 

digiti2dng the continuous audio stream; 

detecting a speaker change in the digitize audio stream; 

performiiig a speaker recognition if a speaker change is detected; and 

transcribing at least part of the continuous audio stream if the known speaker is 
recognizedi 

wherein each speaker is processed using a different dictionary of different topics . 

49. (Currently Atneuded) A program storage device readable by machine, 
tangibly embodying a program of instructions executable by the machine to perform 
method steps for processing a continuous audio stream containing human speech from a 

-11- 
PAGE 21/26 ' RCVD AT mmi 9:24:36 PM [Eastern Standard fime] * SVR:USPTO-EFXRF^I26 ' DNIS:2738300 * CSID:412 741 9292 * DURATION (m[n-ss]:0548 



10-31-' 05 22:25 FROM- 412-741-9292 T-053 P022/026 F-375 

Atty. Docket No. DE920000055US1 

(590.080) 

plurality of speakers related to at least one particular transaction, said method comprising 
the steps of: 

identifying a known speaker from among the plurality of speakers; 

digitizing the continuous audio stream; 

detecting a speaker change in the digitized audio stream; 

performing a speaker recognition if a speaker change is detected; and 

indexing the audio stream with respect to the detected speaker change if the 
known speaker is recogoized- 

\i\^ierein each speaker is processed using a different dictionary of different topics . 
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